epoching: more test infra and some fuzz tests on keeper functionalities #60

SebastianElvis · 2022-07-12T07:33:19Z

Partially fixes BM-63

This PR introduces more test infra for the epoching module, and adds some (stateful) fuzz tests on keeper functionalities. Specifically, it introduces the following:

setup a BabylonApp with a number of validators
append a new block to the simulated app
fuzz tests for epoch_msg_queue.go, epoch_val_set.go, epochs.go

In addition, this PR also removes some redundant unit tests (which can be safely replaced by more powerful fuzzing ones), and fixes two bugs in epoching module's BeginBlock and order of initialising hooks. The bugs are revealed by the fuzz tests.

Some stateful fuzz tests are left to future PRs, including:

fuzzing queue msg handling, which requires mocking queue msgs.
fuzzing slashed validator set, which requires mocking slashing.

One of my feelings is that some of the test infra might be useful for other modules as well, and thus can be part of the project-level SimApp in the future. This depends on the team's decision on the integration test approach.

…-beginblock

app/app.go

app/test_helpers.go

testutil/datagen/pv.go

aakoshh · 2022-07-13T11:29:50Z

x/epoching/keeper/epoch_msg_queue_test.go

+	f.Add(int64(11111))
+	f.Add(int64(22222))
+	f.Add(int64(55555))
+	f.Add(int64(12312))


Why bother with these? If something fails, I thought the application will save it into a place where it can act as a regression test.

This is indeed the case when we run individual fuzz tests. However, CI will only run the test cases added by f.Add without trying other random values generated from the corpus. Will having more corpus be beneficial for CI? If not then I will keep only one of them.

I see, so you want it to run at least a few times. We discussed with @vitsalis that we could add -fuzztime to the CI to run it for a limited amount of iterations.

Not sure what you guys think about doing something like f.Add(time.Now().UnixMilli()) so the tests run as unit tests are not always the same, and you aren't forced to include this arbitrary looking boilerplate all the time. It would ensure that it runs them at least once, and if they fail then hopefully it prints the troublesome seed as well.

The counter argument was that it makes builds non-deterministic. It can be annoying if random tests fail sometimes, not other times, but over a long period of time it would catch more bugs than relying on the same seeds. We could even do something like addRandomSeeds(f, 100) to add exactly 100 random seeds, which is what other frameworks are doing (ie. they achieve 100 passes by default for each test, rather than run forever like Go).

ScalaCheck has CLI flags for the numbers, so maybe we could also support some env vars to adjust defaults. Although I never needed that, I was happy with 100 unless it took really long to generate the data and I had to lower it on a per-test or per-suite basis, like here

we could add -fuzztime to the CI to run it for a limited amount of iterations.

Are you aware of any projects using such strategy? I feel that this requires a very powerful VM to run the CI. My CPU was super hot when executing fuzz tests and sometimes a fuzz test takes >10s to generate the initial corpus.

We could even do something like addRandomSeeds(f, 100) to add exactly 100 random seeds, which is what other frameworks are doing (ie. they achieve 100 passes by default for each test, rather than run forever like Go).

This is a good idea. Basically we limit the number of tries (rather than the running time) for fuzz tests. I will look into this approach and find a way to try it in the subsequent PRs. Hopefully the CI will be happy to do this.

Yes the QuickCheck family runs 100 tests by default and it's up to you to tweak the numbers. I also want tests to finish in reasonable time, max 10 seconds, so if I see that they take longer then I have to reign in how much data I generate.

Maybe I don't need 1000 blocks with up to 100 transactions in each just to test something simple, and maybe I let my tree generation do too much branching and my tree of 10 depth is exponentially wide. Generating random bytes can also get out of hand, like opaque payloads, we don't need them to be hundreds of kilobytes long.

So it's a good indication of where time is not spent well.

And you are right I'd rather know that a particular test passed 100 runs than that some VM spent overall 1 minute testing all fuzz tests and may or may not have covered them equally.

aakoshh · 2022-07-13T11:35:12Z

x/epoching/keeper/epoch_msg_queue_test.go

+			require.Equal(t, sdk.Uint64ToBigEndian(uint64(i)), msg.MsgId)
+			require.Nil(t, msg.Msg)


It may be worth creating a constructor method like NewQueuedMessage(txid, msg) and let it calculate the hash itself, so we're not able to generate invalid ones. Just an idea.

aakoshh · 2022-07-13T11:42:19Z

x/epoching/keeper/epoch_val_set_test.go

+			ctx = nextBlock(app, ctx)
+		}
+
+		// check whether the validator set remains the same or not


Okay but why would it change? There are no transactions, and even if there were the test doesn't say how long an epoch is.

Shouldn't the test generate random registrations and check that the validator set doesn't change during an epoch, but it does at the end?

That's a good idea! However this requires mocking MsgCreateValidator or even MsgWrappedCreateValidator that includes BLS stuff. I have added a TODO on this and will do it in the subsequent PRs.

x/epoching/keeper/epochs_test.go

x/epoching/keeper/keeper_test.go

aakoshh

Nice, you found a lot of good techniques for generating valid looking blocks!

#60)

SebastianElvis added 30 commits June 24, 2022 15:05

init

951dcf3

abci interface

af5ff8d

module stuff

e12dd3f

slash without jail

d6333c1

minor

6cd242a

minor

d997725

apply mature unbonding fix

82eb108

avoid assertions

d6d5996

a lot

a91c059

fix order of beginblockers and endblockers

6bc1fb3

Merge branch 'main' into epoching-staking-code-migration

729da0b

don't touch evidence/slashing at the moment

3d5f5a1

fix and make required staking functions a part of epoching

a6aa599

minor

34caaa5

add dependency to evidence and slashing

97c36cd

Merge branch 'epoching-staking-code-migration' into epoching-endblock…

8818a67

…-beginblock

minor

1b8de2f

minor

3273562

minor

bf09eb8

minor

39b7d61

Merge branch 'epoching-staking-code-migration' into epoching-endblock…

a06bbc1

…-beginblock

msgserver

602ffee

Merge branch 'main' into epoching-endblock-beginblock

a766236

order

8a6a88c

epoching hooks to staking

94e91e6

BeforeValidatorSlashed hook in epoching

fe44d45

BeforeValidatorSlashed hook

be19c4f

minor

12cfbf6

Merge branch 'main' into epoching-endblock-beginblock

8b014f3

fix

6e7872a

SebastianElvis added 10 commits July 11, 2022 15:42

fuzzepoch

82dd3ae

minor fixes

fbbcd4c

Merge branch 'main' into epoching-keeper-tests

a82ef68

some fuzz tests and infra

b739661

minor

fd1383f

test intra and mark TODOs

faaada2

extend to multiple genesis validators

6d05839

fix

f3a1f28

fix ci

efb1dde

Merge branch 'main' into epoching-keeper-tests

e7d7f45

SebastianElvis changed the title ~~epoching: fuzzing tests for keeper functionalities~~ epoching: more test infra for epoching and some fuzz tests on keeper functionalities Jul 13, 2022

SebastianElvis requested a review from aakoshh July 13, 2022 04:57

SebastianElvis marked this pull request as ready for review July 13, 2022 04:57

SebastianElvis changed the title ~~epoching: more test infra for epoching and some fuzz tests on keeper functionalities~~ epoching: more test infra and some fuzz tests on keeper functionalities Jul 13, 2022

bug fix

3278402